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DETAILED ACTION 

1. Claims 1-55 have been examined. 

Papers Submitted 

2. It is hereby acknowledged that the following papers have been received and placed of 
record in the file: RCE, Amendment, and IDS as received on 8/18/2006. 

Continued Examination Under 37 CFR 1.114 

3. A request for continued examination under 37 CFR LI 14, including the fee set forth in 
37 CFR 1.17(e), was filed in this application after final rejection. Since this application is 
eligible for continued examination under 37 CFR 1.1 14, and the fee set forth in 37 CFR 1.17(e) 
has been timely paid, the finality of the previous Office action has been withdrawn pursuant to 
37 CFR 1.114. Applicant's submission filed on August 18, 2006, has been entered. 



Specification 

4. The disclosure is objected to because of the following informalities: On page 23, line 16, 
lines 16-18 are grammatically incorrect. Should the dash between "delay" and "time" be 
replaced with a period? 

Appropriate correction is required. 
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Claim Objections 

5. Claim 1 is objected to because of the following informalities: In line 4, remove "a". 
Appropriate correction is required. 

6. Claim 5 is objected to because of the following informalities: In line 5, replace "a" with 
--an--. Appropriate correction is required. 

7. Claim 7 is objected to because of the following informalities: In line 2, insert 
-scheduled- after "said" for increased clarity. Appropriate correction is required. 

8. Claim 17 is objected to because of the following informalities: In line 4, remove "a". 
Appropriate correction is required. 

9. Claim 36 is objected to because of the following informalities: In line 3, insert 
-scheduled- before "HRT" for increased clarity. Appropriate correction is required. 

10. Claim 48 is objected to because of the following informalities: In line 2, replace "the 
said" with either -the- or -said--. Appropriate correction is required. 

Claim Rejections - 35 USC § 112 

1 1 . The following is a quotation of the first paragraph of 35 U.S.C. 1 12: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

12. Claims 2-18 are rejected under 35 U.S.C. 112, first paragraph, as failing to comply with 
the enablement requirement. The claim(s) contains subject matter which was not described in 
the specification in such a way as to enable one skilled in the art to which it pertains, or with 
which it is most nearly connected, to make and/or use the invention. More specifically, 
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regarding claim 17, the examiner does not understand how one could make or use an invention 
that switches threads between consecutive instruction cycles. As is known, an instruction cycle 
(or clock cycle) is a period of time in which a clock oscillates from low to high. The cycle then 
repeats. So, for instance, as shown below, a first cycle (dotted) is followed by a second cycle 
(solid), which is followed by a third cycle (dotted). 



At any given time, the system is in one of the cycles and it is not clear at what point the 
system is in between cycles. Since it is unknown when the system is between cycles, it is also 
unknown how to make a system which switches threads between cycles. Claims 2-16 and 18 are 
not enabled because they are dependent on a claim that is not enabled. Applicant is asked to 
clarify and make appropriate corrections. 

13. Claims 19-28 are rejected under 35 U.S.C. 1 12, first paragraph, as failing to comply with 
the enablement requirement. The claim(s) contains subject matter which was not described in 
the specification in such a way as to enable one skilled in the art to which it pertains, or with 
which it is most nearly connected, to make and/or use the invention. More specifically, 
regarding claim 19, the examiner does not understand how one could make or use an invention 
that switches threads between the end of an execution cycle and before the beginning of a 
next consecutive execution cycle. As is known, an execution cycle (or clock cycle) is a period 
of time in which a clock oscillates from low to high. The cycle then repeats. So, for instance, as 
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shown below, a first cycle (dotted) is followed by a second cycle (solid), which is followed by a 
third cycle (dotted). 



At any given time, the system is in one of the cycles and it is not clear at what point the 
system is in between cycles. Since it is unknown when the system is between cycles, it is also 
unknown how to make a system which switches threads between cycles. Claims 20-28 are not 
enabled because they are dependent on a claim that is not enabled. Applicant is asked to clarify 
and make appropriate corrections. 

14. The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

15. Claims 5-12, 16-19, 26, 34-41, 45, and 53 are rejected under 35 U.S.C. 112, second 
paragraph, as being indefinite for failing to particularly point out and distinctly claim the subject 
matter which applicant regards as the invention. 

16. Claim 5 recites the limitation "said HRT thread" multiple times in the last paragraph. 
There is insufficient antecedent basis for this limitation in the claim because applicant claims "at 
least one HRT thread" in line 3, This means that multiple HRT threads may exists and therefore, 
it is not clear which HRT thread of the possible multiple HRT threads applicant is referring to in 
the last paragraph. For purposes of examination, "said HRT thread" will be interpreted as "an 
HRT thread". 
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17. Claim 16 recites the limitation "the fixed schedule" in line 1. There is insufficient 
antecedent basis for this limitation in the claim as the parent claim only refers to "an execution 
schedule". For purposes of examination, "the fixed schedule" will be interpreted as "the 
execution schedule". 

18. Claim 17 is rejected under 35 U.S.C. 112, second paragraph, as being indefinite for 
failing to particularly point out and distinctly claim the subject matter which applicant regards as 
the invention. Specifically, switching from the first thread state to the second thread state 
"between consecutive instruction cycles" is indefinite because it is unclear what applicant means 
by "between consecutive instruction cycles". As explained above, the system, at any point in 
time, is operating during an instruction cycle and therefore, it is not known when the system is 
between cycles. For purposes of examination, the examiner will interpret this to mean that the 
switching occurs without incurring a time penalty. The two are apparently equivalent according 
to the specification (page 17, lines 9-12). 

19. Claim 19 is rejected under 35 U.S.C. 112, second paragraph, as being indefinite for 
failing to particularly point out and distinctly claim the subject matter which applicant regards as 
the invention. Specifically, switching from the first thread state to the second thread state 
"between the end of an execution cycle and before the beginning of a next consecutive execution 
cycle" is indefinite because it is unclear what applicant means by "between". As explained 
above, the system, at any point in time, is operating during an execution cycle and therefore, it is 
not known when the system is between cycles. For purposes of examination, the examiner will 
interpret this to mean that the switching occurs without incurring a time penalty. The two are 
apparently equivalent according to the specification (page 17, lines 9-12). 
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20. Claim 26 recites the limitation "the HRT thread" and "said HRT thread" multiple times. 
There is insufficient antecedent basis for this limitation in the claim because applicant claims "at 
least one HRT thread" in parent claim 25. This means that multiple HRT threads may exists and 
therefore, it is not clear which HRT thread of the possible multiple HRT threads applicant is 
referring to in the last paragraph. For purposes of examination, "said HRT thread" will be 
interpreted as "an HRT thread". 

21. Claim 34 recites the limitation "said HRT thread" multiple times in the last paragraph. 
There is insufficient antecedent basis for this limitation in the claim because applicant claims "at 
least one HRT thread" in line 3. This means that multiple HRT threads may exists and therefore, 
it is not clear which HRT thread of the possible multiple HRT threads applicant is referring to in 
the last paragraph. For purposes of examination, "said HRT thread" will be interpreted as "an 
HRT thread". 

22. Claim 45 is unclear because it claims that the predetermined fixed schedule is either a 
fixed strict schedule (Fig. 7a of applicant's drawings), a semi-flexible strict schedule (Fig. 7b of 
applicant's drawings), or a loose strict schedule (Fig.7c of applicant's drawings). However, the 
predetermined fixed schedule is defined in claim 1 as a schedule which specifies that a first 
thread is to be allocated processing time every first number of cycles and a second thread is to be 
allocated processing time every second number of cycles, where the first and second numbers of 
cycles differ. It is not clear how the schedule of claim 1 is also a semi-flexible schedule because 
a first thread A is allocated processing time every first number of cycles and a second thread B is 
allocated processing time every second number of processing cycles where the first and second 
numbers are the same (every 4 cycles). The other threads in this schedule are not fixed and 
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therefore do not apply. Furthermore, it is not clear how the schedule of claim 1 is also a loose 
strict schedule because there is nothing fixed about this schedule, which is required by claim 1 . 
Therefore, the examiner asserts that the schedule of claim 1 can only be a fixed strict schedule. 

23. Claim 53 recites the limitation "the HRT thread" and "said HRT thread" multiple times. 
There is insufficient antecedent basis for this limitation in the claim because applicant claims "at 
least one HRT thread" in parent claim 52. This means that multiple HRT threads may exists and 
therefore, it is not clear which HRT thread of the possible multiple HRT threads applicant is 
referring to in the last paragraph. For purposes of examination, "said HRT thread" will be 
interpreted as "an HRT thread". 

24. Claims 6-12, 18, and 35-41 are rejected under 35 U.S.C. 112, 2 nd paragraph, as being 
indefinite, because they are dependent on claims which are indefinite. 

Claim Rejections - 35 USC § 102 

25. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 351(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 

26. Claims 1 , 29-33, 42, 43, and 45-47 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Borkenhagen et al., U.S. Patent No. 6,076,157 (herein referred to as 
Borkenhagen). 



Application/Control Number: 09/748,098 Page 9 

Art Unit: 2183 

27. Referring to claim 1, Borkenhagen has taught a computer based system for switching 
between program contexts comprising: 

a) a processor capable of having a first program thread and a second program thread in an 
execution pipeline having a thread selection hardware. See Fig.4A and note that the processor 
includes thread switch (selection) hardware which selects between a first thread (thread 0) and a 
second thread (thread 1). Also, see column 7, lines 15-20, and note that the processor has an 
execution pipeline. 

b) a first set of data storage devices capable of storing a first thread state of said processor. See 
Fig.4A, component 442, and column 10, lines 18-56. Note that there is a first set of storage 
devices for storing a group of bits which represent the state of the first thread. 

c) a second set of data storage devices capable of storing a second thread state of said processor. 
See Fig.4A, component 444, and column 10, lines 18-56. Note that there is a second set of 
storage devices for storing a group of bits which represent the state of the second thread. 

d) a hardware thread scheduler for identifying which of said program threads said processor 
executes and configurable to allocate available processing time of the processor among at least 
the first and second program threads by causing thread-switching at a fixed time according to a 
predetermined fixed schedule, said schedule specifying that the first thread should be allocated 
processing time every first number of cycles and that the second thread should be allocated 
processing time every second number of cycles, wherein said first number of cycles is not equal 
to said second number of cycles. See column 13, line 20, to column 14, line 22, column 14, line 
63, to column 15, line 3, and claim 5 of Borkenhagen. In these passages it is taught that threads 
are scheduled with the help of a thread switch control register which includes bits that each 
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correspond to a different thread switch event. It is disclosed that any of the bits may be 
enabled/disabled to achieve switch flexibility for the associated thread. One of the bits is bit 9, 
which calls for switching when the thread switch time-out value is reached. Given two threads 
and given that bit 9 is enabled, a first thread will execute for X cycles before being switched, and 
then a second thread will execute for Y cycles before being switched, where X and Y are 
different. Then the process will repeat since there are only two threads in one embodiment 
(column 5, lines 7-14). For simplicity, assume that the first thread (thread A) has a timeout value 
of 4 cycles while the second thread (thread B) has a timeout value of 2 cycles. The execution of 
these threads would be as follows: 

AAAABBAAAABBAAAABB 

Note that thread A, when it is not executing, is allocated time every 2 cycles while thread 
B, when it is not running, is allocated time every 4 cycles. That is, when thread B is executing, 
after 2 cycles, thread A will be allocated 4 cycles of time. Likewise, when thread A is executing, 
after 4 cycles, thread B will be allocated 2 cycles of time. 

28. Referring to claim 29, Borkenhagen has taught a system as described in claim 1 . 
Borkenhagen has further taught that said thread selection hardware in the pipelined processor 
switches between said first and second thread state after the end of the execution of a first 
program instruction in the first thread and before the beginning of the execution of a second 
program instruction. This is deemed inherent because thread A will execute for some amount of 
time and then a switch will occur to another thread. The switching marks the end of executing 
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an instruction from thread A and the beginning of executing an instruction in thread B. And, 
clearly, the system must be in the second state before it can begin executing the second thread. 

29. Referring to claim 30, Borkenhagen has taught a system as described in claim 1 . 
Borkenhagen has further taught that said processor is an embedded pipelined processor. See 
claim 14 of Borkenhagen. The processor, which is pipelined, is embedded in a system. 

30. Referring to claim 3 1 , Borkenhagen has taught a system as described in claim 1 . 
Borkenhagen has further taught that said first state is the state of the processor during the 
execution of the first program thread. See column 10, lines 18-56. 

3 1 . Referring to claim 32, Borkenhagen has taught a system as described in claim 1 . 
Borkenhagen has further taught that said second state is the state of the processor during the 
execution of the second program thread. See column 10, lines 18-56. 

32. Referring to claim 33, Borkenhagen has taught a system as described in claim 1 . 
Borkenhagen has further taught that said processor switches between said first and second state 
by changing a state selection register. See column 15, lines 1-7. The decrement register (state 
selection register) is decremented each cycle until it gets to zero and then a thread switch occurs. 
Therefore, the processor switches between first and second state by changing a state selection 
register. 

33. Referring to claim 42, Borkenhagen has taught a system as described in claim 1 . 
Borkenhagen has further taught that said processor is capable of restoring said second state of 
said processor during execution of said first program thread. See Fig.4A, component 450, and 
note that at some point during execution of the first thread, a thread switch will occur, and the 
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second state is restored. That is, the switching to (restoring the state of) the second thread occurs 
during execution of the first thread. 

34. Referring to claim 43, Borkenhagen has taught a system as described in claim 1 . 
Borkenhagen has further taught that said processor is capable of storing said second thread state 
of said processor during execution of said first program thread. See column 13, lines 20-45. 
This control register (and control state) allows the system to specify which types of events would 
result in the switching of the associated thread, thereby increasing flexibility by allowing the user 
to choose how the thread may or may not be switched. This register may be set by any thread for 
itself or another thread. 

35. Referring to claim 45, Borkenhagen has taught a system as described in claim 1 . 
Borkenhagen has further taught that the predetermined fixed schedule is one of a fixed strict 
schedule, a semi-flexible strict schedule, and a loose strict schedule. Borkenhagen has taught at 
least a fixed strict schedule where threads are switched based on timeout values. For instance, 
see claim 5 of Borkenhagen. 

36. Referring to claim 46, Borkenhagen has taught a computer based method for switching 
between program contexts in a multithreading pipelined processor (see Fig.4A and the abstract) 
having a hardware thread selector (see Fig.4A) and an execution pipeline (see column 7, lines 
15-20 and note that the processor has an execution pipeline), the method comprising: 

a) storing a first context of said processor in a first set of data storage devices comprising a first 
thread state corresponding to a first program thread. See Fig.4A, component 442, and column 
10, lines 18-56. Note that there is a first set of storage devices for storing a group of bits which 
represent the state of the first thread. 
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b) storing a second context of said processor in a second set of data storage devices comprising a 
second thread state corresponding to a second program thread. See Fig.4A, component 444, and 
column 10, lines 18-56. Note that there is a second set of storage devices for storing a group of 
bits which represent the state of the second thread. 

c) switching the processor from the first thread state to the second thread state by coupling the 
execution pipeline from the first set of data storage devices to the second set of storage devices 
via the hardware thread selector at a fixed time according to a predetermined fixed execution 
schedule, said execution schedule specifying that the processor should switch to the first thread 
state every first number of cycles and that the processor should switch to the second thread state 
every second number of cycles, wherein said first number of cycles is not equal to said second 
number of cycles. See column 13, line 20, to column 14, line 22, column 14, line 63, to column 
15, line 3, and claim 5 of Borkenhagen. In these passages it is taught that threads are scheduled 
with the help of a thread switch control register which includes bits that each correspond to a 
different thread switch event. It is disclosed that any of the bits may be enabled/disabled to 
achieve switch flexibility for the associated thread. One of the bits is bit 9, which calls for 
switching when the thread switch time-out value is reached. Given two threads and given that 
bit 9 is enabled, a first thread will execute for X cycles (using its respective state storage - 
Fig.4A, component 442) before being switched, and then a second thread will execute for Y 
cycles (using its respective state storage - Fig.4A, component 442) before being switched, where 
X and Y are different. Then the process will repeat since there are only two threads in one 
embodiment (column 5, lines 7-14). For simplicity, assume that the first thread (thread A) has a 
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timeout value of 4 cycles while the second thread (thread B) has a timeout value of 2 cycles. The 
execution of these threads would be as follows: 

AAAABBAAAABBAAAABB 



Note that thread A, when it is not executing, is allocated time every 2 cycles while thread 
B, when it is not running, is allocated time every 4 cycles. That is, when thread B is executing, 
after 2 cycles, thread A will be allocated 4 cycles of time. Likewise, when thread A is executing, 
after 4 cycles, thread B will be allocated 2 cycles of time. 

37. Referring to claim 47, Borkenhagen has taught a method as described in claim 46. 
Borkenhagen has further taught that the switching comprises changing a state selection register 
included in the hardware thread selector. See column 15, lines 1-7. The decrement register 
(state selection register), which is part of the thread selector, is decremented each cycle until it 
gets to zero and then a thread switch occurs. Therefore, the processor switches between first and 
second states by changing a state selection register. 

Claim Rejections - 35 USC §103 

38. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 
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39. Claims 2-4, 13, 16-17, and 19-24 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Joy et al., U.S. Patent No. 6,542,991 (as applied in the previous Office Action 
and herein referred to as Joy), in view of Emer et al., U.S. Patent No. 6,493,741 (as applied in the 
previous Office Action and herein referred to as Emer). 

40. Referring to claim 17, Joy has taught a computer based system for switching between 
program contexts comprising: 

a) a pipelined processor (Fig.3, component 300; column 8, lines 14-67) capable of having a first 
program thread and a second program thread in an execution pipeline having a thread selection 
hardware (see Fig.6 and note the "thread select logic"; column 13 lines 5-23, column 15 lines 4- 
7), the execution pipeline including a set of stages for executing instructions and configured to 
execute a single instruction at each different stage in the set of stages (see claim 1, for instance, 
and note the existence of a pipeline). It should be noted that pipelines inherently include stages 
which perform different tasks, and in which a different instruction may be executing at one time. 

b) a first set of data storage devices capable of storing a first thread state of said pipelined 
processor. See Fig.3, component 310, and column 8, lines 27-44. Note that component 3 1 0 
includes flip-flops and a register file structure for storing a first thread state (for thread 0). 

c) a second set of data storage devices capable of storing a second thread state of said pipelined 
processor. See Fig.3, component 312, and column 8, lines 27-44. Note that component 312 
includes flip-flops and a register file structure for storing a second thread state (for thread 1). 

d) a hardware thread scheduler for identifying which of said program threads said pipelined 
processor executes (Fig.6; column 15, lines 4-7) and configurable to allocate available 
processing time of the pipelined processor among at least the first and second program threads 
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according to an execution schedule. See column 3, lines 33-51, and note that Joy's thread- 
switching may be of the oblivious type, in which threads are switched every N cycles without 
notification of stalling. 

e) Joy has not explicitly taught that said thread selection hardware in the pipelined processor 
switches from said first thread state to said second thread state between consecutive instruction 
cycles in response to the hardware thread scheduler identifying which of said program threads 
said pipelined processor executes. However, recall that Joy has taught that threads may be 
switched every N cycles. Since there is no disclosed restriction as to what value N might be, a 
thread switch may occur every cycle (N^l). When N=l, a fine-grained multithreaded system is 
achieved, as is known in the art. Emer has taught such a fine-grained system in which threads 
are switched every cycle (in consecutive cycles). See Fig. 1(b) and column 1, lines 52-65. Such 
a system eliminates vertical waste, thereby increasing throughput. Furthermore, in some 
instances, stalls for a particular thread are at least partially, and sometimes fully, masked. That 
is, if a thread may stall for a maximum period of X cycles, then if the system includes X threads, 
the stall will never affect the system. For example, assume that a system has 4 threads, and a 
thread may stall for a maximum of 4 cycles. In cycle 0, thread 0 must stall, in cycle 1, thread 1 
will execute, in cycle 2, thread 2 will execute, in cycle 3, thread 3 will execute, and then in cycle 
4, by the time thread 0 is to execute again, the stall time would have elapsed. The stall is 
effectively masked due to the switching every cycle. As a result, in order to eliminate vertical 
waste, increase throughput, and mask stalling in some instances, since Joy discloses that threads 
may be switched every N cycles, it would have been obvious to one of ordinary skill in the art at 
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the time of the invention to have N=l and have Joy switch threads on consecutive cycles, as 
taught by Emer. 

41. Referring to claim 2, Joy in view of Emer has taught a system as described in claim 17. 
Joy has further taught that said first thread state is the thread state of the processor during the 
execution of the first program thread. See Fig. 3, component 310, and column 8, lines 27-44. 
Note that component 310 includes flip-flops and a register file structure for storing a first thread 
state (for thread 0). 

42. Referring to claims 3, Joy in view of Emer has taught a system as described in claim 17. 
Joy has further taught that said second thread state is the thread state of the processor during the 
execution of the second program thread. See Fig.3, component 312, and column 8, lines 27-44. 
Note that component 312 includes flip-flops and a register file structure for storing a second 
thread state (for thread 1). 

43. Referring to claim 4, Joy in view of Emer has taught a system as described in claim 17. 
Joy has further taught that said processor switches between said first and second thread state by 
changing a state selection register. See Fig.5 and column 13, lines 5-64. The thread select logic 
includes a flip-flop for each thread, where the flip-flops collectively form a register that includes 
an active bit in the position corresponding to the active thread. 

44. Referring to claim 13, Joy in view of Emer has taught a system as described in claim 1 7. 
Joy has further taught that said processor is capable of restoring said second thread state of said 
processor during execution of said first program thread. See column 6, lines 1 5-35. Clearly, a 
thread is executed until a switch signal is given. Therefore, switching to (restoring a) thread 
occurs during execution of another thread. 
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45. Referring to claim 16, Joy in view of Emer has taught a system as described in claim 17. 
Joy has further taught that the fixed schedule is one of a fixed strict schedule, a semi-flexible 
strict schedule, and a loose strict schedule. From column 3, lines 28-51, switching every N 
cycles is considered a fixed strict schedule, i.e., a thread switch must occur every N cycles. 

46. Referring to claim 19, Joy has taught a computer based method for switching between 
program contexts in a multithreading pipelined processor (Fig. 3; column 8, lines 14-67) having a 
hardware thread selector (Fig. 6) and an execution pipeline (see claim 1 of Joy, for instance), the 
execution pipeline including a set of stages for executing instructions and configured to execute a 
single instruction at each different stage of the set of stages (it should be noted that pipelines 
inherently include stages which perform different tasks, and in which a different instruction may 
be executing at one time), the method comprising: 

a) storing a first context of said pipelined processor in a first set of data storage devices, the first 
context corresponding to a first program thread. See Fig.3, component 3 1 0, and column 8, lines 
27-44. Note that component 310 includes flip-flops and a register file structure for storing a first 
thread state (for thread 0). 

b) storing a second context of said pipelined processor in a second set of data storage devices, the 
second context corresponding to a second program thread. See Fig.3, component 312, and 
column 8, lines 27-44. Note that component 312 includes flip-flops and a register file structure 
for storing a second thread state (for thread 1). 

c) Joy has not explicitly taught switching the pipelined processor from executing the first 
program thread to executing the second program thread between the end of an execution cycle 
and before the beginning of a next consecutive execution cycle by coupling the execution 
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pipeline from the first set of data storage devices to the second set of storage devices via the 
hardware thread selector. However, recall that Joy has taught that threads may be switched 
every N cycles (column 3, lines 33-38). Since there is no disclosed restriction as to what value N 
might be, a thread switch may occur every cycle (N~l)* When N=l, a fine-grained 
multithreaded system is achieved, as is known in the art, Emer has taught such a fine-grained 
system in which threads are switched every cycle (in consecutive cycles). See Fig. 1(b) and 
column 1, lines 52-65. Such a system eliminates vertical waste, thereby increasing throughput. 
Furthermore, in some instances, stalls for a particular thread are at least partially, and sometimes 
fully, masked. That is, if a thread may stall for a maximum period of X cycles, then if the system 
includes X threads, the stall will never affect the system. For example, assume that a system has 
4 threads, and a thread may stall for a maximum of 4 cycles. In cycle 0, thread 0 must stall, in 
cycle 1, thread 1 will execute, in cycle 2, thread 2 will execute, in cycle 3, thread 3 will execute, 
and then in cycle 4, by the time thread 0 is to execute again, the stall time would have elapsed. 
The stall is effectively masked due to the switching every cycle. As a result, in order to 
eliminate vertical waste, increase throughput, and mask stalling in some instances, since Joy 
discloses that threads may be switched every N cycles, it would have been obvious to one of 
ordinary skill in the art at the time of the invention to have N=l and have Joy switch threads on 
consecutive cycles, as taught by Emer. 

47. Referring to claim 20, Joy in view of Emer has taught a system as described in claim 19. 
Joy has further taught that the switching comprises changing a state selection register included in 
the hardware thread selector. See Fig.5 and column 13, lines 5-64. The thread selector logic 
includes a flip-flop for each thread, where the flip-flops collectively form a register that includes 
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an active bit in the position corresponding to the active thread. When a thread is switched, then a 
new bit in the register would be set while the previously set bit is reset. 

48. Referring to claim 21, Joy in view of Emer has taught a method as described in claim 19. 
Joy has further taught identifying which of the said program threads said processor executes 
according to an execution schedule. See column 3, lines 33-51. 

49. Referring to claim 22, Joy in view of Emer has taught a method as described in claim 21 . 
Joy has further taught allocating available processing time of the processor among at least the 
first and second threads according to the execution schedule. See column 3, lines 33-51 . 

50. Referring to claim 23, Joy in view of Emer has taught a method as described in claim 22. 
Joy has further taught that the allocating comprises dividing the available execution time into a 
plurality of quanta, each quanta corresponding to a number of instruction cycles for execution of 
a thread. See column 3, lines 33-51, and note that each thread will execute for N cycles in one 
scheduling embodiment. 

5 1 . Referring to claim 24, Joy in view of Emer has taught a method as described in claim 23. 
Joy has further taught that at least one quanta corresponds to a thread that is scheduled to execute 
periodically after a fixed number of execution cycles. See column 3, lines 33-36. 

52. Claims 5-12, 18, and 25-28 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Joy in view of Emer and further in view of Ramakrishnan et al., U.S Patent No. 6,085,215 
(as applied in the previous Office Action and herein referred to as Ramakrishnan). 

53. Referring to claim 5, Joy in view of Emer has taught a system as described in claim 17. 
Joy in view of Emer has not taught that said hardware thread scheduler includes a thread 
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identifier for identifying at least one hard-real-time (HRT) thread and at least one non-real-time 
thread and a HRT scheduler for regularly scheduling said HRT thread in available time quanta 
such that said HRT thread is scheduled to ensure the execution of the HRT in a predetermined 
time. However, Ramakrishnan has taught such a concept. See column 4, line 52, to column 5, 
line 3, and the abstract. Note that real-time and general (non real-time) threads are determined 
and that a real time thread is scheduled during the available time quanta such that it executes in 
predetermined time. In such a system, real-time threads, which perform time-critical tasks, are 
given priority over general threads. This is clear because the general threads execute for a 
minimum time and then after that, if a real time thread needs processing, then it will preempt that 
general thread. Furthermore, Ramakrishnan has taught that such a system prevents starvation of 
threads (since all threads get some time to process) and offers a greater degree of fairness in 
allocating processing resources to various tasks. See column 4, lines 11-14. As a result, in order 
to execute time-critical tasks in a non-starving and fair way, it would have been obvious to one 
of ordinary skill in the art at the time of the invention to modify Joy to be used in a time-critical 
environment and to include at least one HRT thread and an HRT scheduler, as taught by 
Ramakrishnan. 

54. Referring to claim 6, Joy in view of Emer and further in view of Ramakrishnan has 
taught a system as described in claim 5. Ramakrishnan has further taught that said time quanta is 
at least one instruction cycle. See the abstract and note that real-time threads are scheduled for a 
preselected maximum amount of time. This time is inherently at least one cycle because if it 
were any less (zero cycles), then the thread would never execute. 



Application/Control Number: 09/748,098 Page 22 

Art Unit: 2183 

55. Referring to claim 7, Joy in view of Emer and further in view of Ramakrishnan has 
taught a system as described in claim 5. Ramakrishnan has further taught that said hardware 
thread scheduler schedules a non-real-time (NRT) thread to replace a scheduled HRT thread if 
said HRT is idle. See Fig.3, step 74, and note that if a real-time thread is idle (has no work), then 
the schedule is free to replace it with another thread, where another thread is either an NRT or 
another HRT. 

56. Referring to claim 8, Joy in view of Emer and further in view of Ramakrishnan has 
taught a system as described in claim 5. Ramakrishnan has further taught that said thread 
scheduler schedules the execution of non-real-time (NRT) threads in quanta not allocated to 
HRT threads. See the Fig.2A and note that a real-time thread is allocated time and that time is 
the real-time thread's time (step 52). After the HRT is done, an NRT may be scheduled. That is, 
they are both not scheduled in the same quanta (at the same time). The HRT is to execute 
(assuming the HRT is first to execute), and then the NRT is to execute. 

57. Referring to claim 9, Joy in view of Emer and further in view of Ramakrishnan has 
taught a system as described in claim 8. Ramakrishnan has further taught that said thread 
scheduler regularly schedules NRT threads to be executed. See the abstract and note that there 
may be a plurality of NRTs for scheduling. They are scheduled for minimum times throughout 
the entire execution process. 

58. Referring to claim 10, Joy in view of Emer and further in view of Ramakrishnan has 
taught a system as described in claim 5. Joy has further taught: 

a) a first storage device for storing program instructions, said processor fetching instructions 
from the first storage device within a first fetch period. See Fig.3, component 330. The 
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instruction cache (1$) will be fetched from during the time that the instructions needed are in the 
cache. 

b) a second storage device for storing program instructions, said processor fetching instructions 
from the second storage device within a second fetch period. See column 9, line 66. The main 
memory will be fetched from during the time that the instructions needed are not in the cache. 

c) wherein said first fetch period is substantially shorter than said second fetch period. Fetching 
from a cache is shorter than fetching from main memory, as is known in the art. 

59. Referring to claim 11, Joy in view of Emer and further in view of Ramakrishnan has 
taught a system as described in claim 10. Joy in view of Emer and further in view of 
Ramakrishnan has not taught that said first storage device for storing program instructions 
comprises a static RAM. However, Official Notice is taken that virtually all caches are 
implemented with static RAM (SRAM) and that SRAM and its advantages are well known and 
accepted in the art. SRAM is fast, which makes it suitable for caches, and unlike DRAM, it does 
not need to be refreshed in order to maintain its contents. Consequently, for speed and storage 
ability, it would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Joy's instruction cache such that it is implemented in SRAM. 

60. Referring to claim 12, Joy in view of Emer and further in view of Ramakrishnan has 
taught a system as described in claim 10. Joy in view of Emer and further in view of 
Ramakrishnan has not taught that said second storage device for storing program instructions 
comprises a flash memory. However, Official Notice is taken that flash-based main memories 
and their advantages are well known and accepted in the art. A computer hierarchy based upon 
volatile main memory loses all information in main memory when power is turned off. A flash- 
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based non-volatile main memory, however, reduces or eliminates the lengthy process of 
obtaining information from disk when power is turned on. Therefore flash main memory based 
computer system has higher system performance when a program is initially executed than 
would a volatile main memory based computer system. As a result, it would have been obvious 
to one of ordinary skill in the art at the time of the invention to modify Joy's main memory such 
that it is a flash-based main memory. 

61 . Referring to claim 18, Joy in view .of Emer and further in view of Ramakrishnan has 
taught a system as described in claim 5. Emer has further taught that said time quanta is exactly 
one instruction cycle. See Fig. 1(b), and column 1, lines 52-65. 

62. Referring to claim 25, Joy in view of Emer has taught a method as described in claim 21 . 
Joy in view of Emer has not taught identifying at least one hard real-time (HRT) thread and at 
least one non real-time (NRT) thread.. However, Ramakrishnan has taught the concept of real- 
time threads and general (non-real-time). See column 4, line 52, to column 5, line 3, and the 
abstract. Note that real-time threads are identified and are those that perform time-critical work 
and therefore should be given priority in the system over general threads. This priority concept 
is clear in Ramakrishnan because the general threads execute for a minimum time and then after 
that, if a real time thread needs processing, then it will preempt that general thread. Furthermore, 
Ramakrishnan has taught that such a system gets important work done while preventing 
starvation of threads (since all threads get some time to process) and offering a greater degree of 
fairness in allocating processing resources to various tasks. See column 4, lines 11-14. As a 
result, in order to execute time-critical tasks in a non-starving and fair way, it would have been 
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obvious to one of ordinary skill in the art at the time of the invention to modify Joy to identify at 
least one HRT and at least one NRT thread, as taught by Ramakrishnan. 

63. Referring to claim 26, Joy in view of Emer and further in view of Ramakrishnan has 
taught a method as described in claim 25. Ramakrishnan has further taught scheduling the HRT 
thread in available time quanta such that said HRT thread is scheduled to ensure the execution of 
the HRT thread within a predetermined time. See column 4, line 52, to column 5, line 3, and the 
abstract. Note that HRT threads are given a maximum time in which to execute. This ensures 
the execution of the HRT within that maximum time. 

64. Referring to claim 27, Joy in view of Emer and further in view of Ramakrishnan has 
taught a method as described in claim 25. Ramakrishnan has further taught scheduling an NRT 
thread for a quantum allocated for an HRT thread if said HRT thread is idle. See Fig. 3, step 74, 
and note that if a real-time thread is idle (has no work), then the schedule is free to replace it with 
another thread, where another thread is either an NRT or another HRT. 

65. Referring to claim 28, Joy in view of Emer and further in view of Ramakrishnan has 
taught a method as described in claim 25. Ramakrishnan has further taught scheduling NRT 
threads in quanta not allocated for HRT threads. See the Fig.2A and note that a real-time thread 
is allocated time and that time is the real-time thread's time (step 52). After the HRT is done, an 
NRT may be scheduled. That is, they are both not scheduled in the same quanta (at the same 
time). The HRT is to execute (assuming the HRT is first to execute), and then the NRT is to 
execute. 
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66. Claim 14 is rejected under 35 U.S.C. 103(a) as being unpatentable over Joy in view of 
Emer and further in view of Borkenhagen. 

67. Referring to claim 14, Joy in view of Emer has taught a system as described in claim 17. 
Joy in view of Emer has not explicitly taught that said processor is capable of storing said second 
thread state of said processor during execution of said first program thread. However, 
Borkenhagen has taught that a thread switch control register may be implemented for each thread 
for holding a state of that thread and that the control state for the second thread may be stored 
during execution of the first thread. See column 13, lines 20-45. This control register (and 
control state) allows the system to specify which types of events would result in the switching of 
the associated thread, thereby increasing flexibility by allowing the user to choose how the 
thread may or may not be switched. Consequently, it would have been obvious to one of 
ordinary skill in the art at the time of the invention to modify Joy in view of Emer to include the 
control register of Borkenhagen as part of the thread state, where the thread state of the second 
thread is stored during execution of the first thread. 

68. Claim 15 is rejected under 35 U.S.C. 103(a) as being unpatentable over Joy in view of 
Emer and further in view of Levy et al., U.S. Patent No. 6,314,51 1 (herein referred to as Levy). 

69. Referring to claim 15, Joy in view of Emer has taught a system as described in claim 17. 
While Joy has taught that said first set of data storage devices comprises registers (see column 8, 
lines 27-44, and column 3, lines 3-10), Joy has not taught that the registers are shared by a 
plurality of threads. Instead, Joy has taught that each thread gets its own register file. However, 
Levy has taught that some tests have shown that shared registers provide performance gains 
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when compared to dedicated per-thread register designs, as taught by Joy. In addition, by 
sharing registers, the total size of the register file is reduced (since you don't have to have a 
separate register file for each thread) without sacrificing performance. See Levy, column 8, line 
65, to column 9, line 3. As a result, it would have been obvious to one of ordinary skill in the art 
at the time of the invention to modify Joy such that the registers are shared by the threads instead 
of replicated. 

70. Claims 34-41 and 48-55 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Borkenhagen in view of Ramakrishnan. 

71 . Referring to claim 34, Borkenhagen has taught a system as described in claim 1 . 
Borkenhagen has not taught that said hardware thread scheduler includes a thread identifier for 
identifying at least one hard-real-time (HRT) thread and at least one non-real-time thread and a 
HRT scheduler for regularly scheduling said HRT thread in available time quanta such that said 
HRT thread is scheduled to ensure the execution of the HRT in a predetermined time. However, 
Ramakrishnan has taught such a concept. See column 4, line 52, to column 5, line 3, and the 
abstract. Note that real-time and general (non real-time) threads are determined and that a real 
time thread is scheduled during the available time quanta such that it executes in predetermined 
time (i.e., in a preselected maximum time). In such a system, real-time threads, which perform 
time-critical tasks, are given priority over general threads. This is clear because the general 
threads execute for a minimum time and then after that, if a real time thread needs processing, 
then it will preempt that general thread. Furthermore, Ramakrishnan has taught that such a 
system prevents starvation of threads (since all threads get some time to process) and offers a 
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greater degree of fairness in allocating processing resources to various tasks. See column 4, lines 
11-14. As a result, in order to execute time-critical tasks in a non-starving and fair way, it would 
have been obvious to one of ordinary skill in the art at the time of the invention to modify 
Borkenhagen be used in a time-critical environment and to include at least one HRT thread and 
an HRT scheduler, as taught by Ramakrishnan. 

72. Referring to claim 35, Borkenhagen in view of Ramakrishnan has taught a system as 
described in claim 34. Ramakrishnan has further taught that said time quanta is at least one 
instruction cycle. See the abstract and note that real-time threads are scheduled for a preselected 
maximum amount of time. This time is inherently at least one cycle because if it were any less 
(zero cycles), then the thread would never execute. 

73. Referring to claim 36, Borkenhagen in view of Ramakrishnan has taught a system as 
described in claim 34. Ramakrishnan has further taught that said hardware thread scheduler 
schedules a non-real-time (NRT) thread to replace a scheduled HRT thread if said HRT is idle. 
See Fig. 3, step 74, and note that if a real-time thread is idle (has no work), then the schedule is 
free to replace it with another thread, where another thread is either an NRT or another HRT. 

74. Referring to claim 37, Borkenhagen in view of Ramakrishnan has taught a system as 
described in claim 34. Ramakrishnan has further taught that said hardware thread scheduler 
schedules the execution of non-real-time (NRT) threads in quanta not allocated to HRT threads. 
See the Fig.2A and note that a real-time thread is allocated time and that time is the real-time 
thread's time (step 52). After the HRT is done, an NRT may be scheduled. That is, they are 
both not scheduled in the same quanta (at the same time). The HRT is to execute (assuming the 
HRT is first to execute), and then the NRT is to execute. 



Application/Control Number: 09/748,098 Page 29 

Art Unit: 2183 

75. Referring to claim 38, Borkenhagen in view of Ramakrishnan has taught a system as 
described in claim 37. Ramakrishnan has further taught that said hardware thread scheduler 
regularly schedules NRT threads to be executed. See the abstract and note that there may be a 
plurality of NRTs for scheduling. 

76. Referring to claim 39, Borkenhagen in view of Ramakrishnan has taught a system as 
described in claim 34. Borkenhagen has further taught: 

a) a first storage device for storing program instructions, said processor fetching instructions 
from the first storage device within a first fetch period. See Fig. 1 , component 150. The cache 
will be fetched from during the time that the instructions needed are in the cache. 

b) a second storage device for storing program instructions, said processor fetching instructions 
from the second storage device within a second fetch period. See Fig.l, component 140. The 
main memory will be fetched from during the time that the instructions needed are not in the 
cache. 

c) wherein said first fetch period is substantially shorter than said second fetch period. See 
column 3, lines 13-17. Fetching from a cache is shorter than fetching from main memory. 

77. Referring to claim 40, Borkenhagen in view of Ramakrishnan has taught a system as 
described in claim 39. Borkenhagen has not taught that said first storage device for storing 
program instructions comprises a static RAM. However, Official Notice is taken that virtually 
all caches are implemented with static RAM (SRAM) and that SRAM and its advantages are 
well known and accepted in the art. SRAM is fast, which makes it suitable for caches, and 
unlike DRAM, it does not need to be refreshed in order to maintain its contents. Consequently, 
for speed and storage ability, it would have been obvious to one of ordinary skill in the art at the 
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time of the invention to modify Borkenhagen's instruction cache such that it is implemented in 
SRAM. 

78. Referring to claim 41, Borkenhagen in view of Ramakrishnan has taught a system as 
described in claim 39. Borkenhagen has not taught that said second storage device for storing 
program instructions comprises a flash memory. However, Official Notice is taken that flash- 
based main memories and their advantages are well known and accepted in the art. A computer 
hierarchy based upon volatile main memory loses all information in main memory when power 
is turned off. A flash-based non-volatile main memory, however, reduces or eliminates the 
lengthy process of obtaining information from disk when power is turned on. Therefore flash 
main memory based computer system has higher system performance when a program is initially 
executed than would a volatile main memory based computer system. As a result, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to modify 
Borkenhagen's main memory such that it is a flash-based main memory. 

79. Referring to claim 48, Borkenhagen has taught a method as described in claim 46. 
Borkenhagen has not taught identifying which of the said program threads said processor 
executes according to a hard real-time (HRT) execution schedule. However, Ramakrishnan has 
taught the concept of real-time threads. See column 4, line 52, to column 5, line 3, and the 
abstract. Note that real-time threads are identified and are those that perform time-critical work 
and therefore should be given priority in the system. In such a system, real-time threads are 
given priority over general threads (note that Borkenhagen has also taught using priority with 
threads in the abstract). This priority concept is clear in Ramakrishnan because the general 
threads execute for a minimum time and then after that, if a real time thread needs processing, 
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then it will preempt that general thread. Furthermore, Ramakrishnan has taught that such a 
system prevents starvation of threads (since all threads get some time to process) and offers a 
greater degree of fairness in allocating processing resources to various tasks, while allowing 
important work to get done. See column 4, lines 11-14. As a result, in order to execute time- 
critical tasks in a non-starving and fair way, it would have been obvious to one of ordinary skill 
in the art at the time of the invention to modify Borkenhagen to be used in a time-critical 
environment and to identify which of the program threads the processor executes according to an 
HRT execution schedule, as taught by Ramakrishnan. 

80. Referring to claim 49, Borkenhagen in view of Ramakrishnan has taught a method as 
described in claim 48. Borkenhagen has further taught allocating available processing time of 
the processor among at least the first and second threads according to the predetermined fixed 
execution schedule. See claim 5 of Borkenhagen and recall that a first thread is executed for a 
certain amount of time and then a second thread is executed for a certain amount of time. Given 
only two threads, this cycle will repeat. 

81 . Referring to claim 50, Borkenhagen in view of Ramakrishnan has taught a method as 
described in claim 49. Borkenhagen has further taught that the allocating comprises dividing the 
available execution time into a plurality of quanta, each quanta corresponding to a number of 
instruction cycles for execution of a thread. Again, see claim 5 of Borkenhagen. Each thread is 
allocated a number of cycles (quanta) in which to execute. This number is associated with the 
timeout value. The overall processing time will be divided into an amount of time for the first 
thread and an amount of time for the second thread. 
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82. Referring to claim 51, Borkenhagen in view of Ramakrishnan has taught a method as 
described in claim 50. Borkenhagen has further taught that at least one quantum corresponds to 
a thread that is scheduled to execute periodically after a fixed number of execution cycles. A 
thread in Borkenhagen will execute after a fixed number of cycles in which another thread 
executes. For instance, thread A might execute for 10 cycles, and thread B might execute for 5 
cycles. Five cycles into thread B's execution, thread A will be switched in. This happens every 
time the system gets 5 cycles into thread B's execution (it's fixed). 

83. Referring to claim 52, Borkenhagen in view of Ramakrishnan has taught a method as 
described in claim 48. Ramakrishnan has further taught that identifying further comprises 
identifying at least one hard real-time (HRT) thread and at least one non real-time (NRT) thread. 
See column 4, line 52, to column 5, line 3, and the abstract. Note that real-time threads are 
identified and are those that perform time-critical work and therefore should be given priority in 
the system over general threads, which are the less time-critical threads. 

84. Referring to claim 53, Borkenhagen in view of Ramakrishnan has taught a method as 
described in claim 52. Ramakrishnan has further taught scheduling the HRT thread in available 
time quanta such that said HRT thread is scheduled to ensure the execution of the HRT thread 
within a predetermined time. See column 4, line 52, to column 5, line 3, and the abstract. Note 
that HRT threads are given a maximum time in which to execute. This ensures the execution of 
the HRT within that maximum time. 

85. Referring to claim 54, Borkenhagen in view of Ramakrishnan has taught a method as 
described in claim 52. Ramakrishnan has further taught scheduling an NRT thread for a 
quantum allocated for an HRT thread if said HRT thread is idle. See Fig.3, step 74, and note that 
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if a real-time thread is idle (has no work), then the schedule is free to replace it with another 
thread, where another thread is either an NRT or another HRT. 

86. Referring to claim 55, Borkenhagen in view of Ramakrishnan has taught a method as 
described in claim 52. Ramakrishnan has further taught scheduling NRT threads in quanta not 
allocated for HRT threads. See the Fig.2A and note that a real-time thread is allocated time and 
that time is the real-time thread's time (step 52). After the HRT is done, an NRT may be 
scheduled. That is, they are both not scheduled in the same quanta (at the same time). The HRT 
is to execute (assuming the HRT is first to execute), and then the NRT is to execute. 

87. Claim 44 is rejected under 35 U.S.C. 103(a) as being unpatentable over Borkenhagen in 
view of Levy. 

88. Referring to claim 44, Borkenhagen has taught a system as described in claim 1 . While 
Borkenhagen has hinted at sharing of resources (column 5, lines 56-57), Borkenhagen has not 
explicitly taught that said first set of storage devices comprises registers shared by a plurality of 
threads. However, Levy has taught that some tests have shown that shared registers provide 
performance gains when compared to dedicated per-thread register designs. In addition, by 
sharing registers, the total size of the register file is reduced (since you don't have to have a 
separate register file for each thread) without sacrificing performance. See Levy, column 8, line 
65, to column 9, line 3. As a result, it would have been obvious to one of ordinary skill in the art 
at the time of the invention to modify Borkenhagen such that the registers are shared by the 
threads. 
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Response to Arguments 

89. Applicant's arguments with respect to claims 1 and 46 (pages 14-18 of the remarks) have 
been considered but are moot in view of the new ground(s) of rejection. 

90. Applicant argues the novelty/rejection of claims 17 and 19, on page 20 of the remarks, in 
substance that: 

"As clearly illustrated in Figure 1, the architectures, disclosed by Emer are configured to receive 
multiple instructions in a single processor cycle for the same stage, the issue stage. Therefore, at 
least the issue stage of Emer operates on multiple instructions in a single cycle. Emer does not 
disclose a pipelined processor that executes a single instruction in each stage of the pipeline as 
recited in the claims, and Emer certainly does not disclose switching such a pipelined processor 
between consecutive instruction cycles." 

91. These arguments are not found persuasive for the following reasons: 

a) As is known, a pipeline executes a single instruction per stage. If there are 5 stages in a 
pipeline, then 5 instructions may be executed at a time (one per stage). Looking at Emer, Fig. 2, 
each column is a stage of a different pipeline so a pipeline still executes a single instruction. 
However, even if all of the columns together were considered a pipeline, the pipeline still 
executes a single instruction in the stage. It executes a single instruction and additional 
instructions, which is a valid interpretation of the claim due to the "comprising" language. 
Furthermore, halfway down in Fig.2b s it is shown that the pipeline is executing no more than one 
instruction, and so again, it has been taught that a single instruction is executed by the pipeline in 
a single cycle. Regardless, it should be realized that the purpose of using Emer is to show that 
threads may be switched every cycle (between instructions) and the resulting benefit of such a 
concept is reduced vertical waste. 
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92. Applicant argues the novelty/rejection of claims 17 and 19, on page 21 of the remarks, in 
substance that: 

"It is not apparent how one of ordinary skill in the art would combine the single-processor 
vertically-threaded processor of Joy with the superscalar, multithreading, and simultaneous 
multithreading architectures of Emer. The single-processor vertically-threaded processor of Joy 
has a single pipeline shared among a plurality of threads. The thread switch logic of Joy selects 
at most one thread at any given time as the active thread, and the "currently active 
thread... supplies data to functional blocks connected to the pipeline." (See Joy Col. 8, 1 1. 19-25.) 
The architectures illustrated in Emer are capable of receiving multiple instructions in a single 
clock cycle. Therefore, the thread switch logic of Joy, which merely activates one thread for 
execution on the functional blocks of the pipeline, would not be able to switch the superscalar, 
multithreading, and simultaneous multithreading processors of Emer." 

93. These arguments are not found persuasive for the following reasons: 

a) Again, the exact pipeline structure of Emer is not necessarily important, nor is it being relied 
upon, in the combination. The idea extracted from Emer is merely that threads may be switched 
every cycle. Whatever allows threads to be switched every cycle in Emer would be implemented 
in Joy as switching threads every cycle is not tied to a specific type of pipeline structure. The 
reason for switching every cycle is to eliminate vertical waste. In Joy, a thread is executed by 
the pipeline for N cycles, with no limit on N. The examiner is merely stating that it would be 
obvious for N to be 1, as taught by Emer, in order to reduce vertical waste. This just means that 
Joy's pipeline would execute a different thread every cycle. 



94. Applicant argues the novelty/rejection of claims 1 7 and 19, on page 22 of the remarks, in 
substance that: 

"Moreover, Joy explicitly states that an overhead delay of one or more clock cycles is necessary 
for the single-processor vertically-threaded processor. (See col. 16, 1 1. 1-5 and 1 1.61-62.) Joy 
teaches that such a delay is necessary to switch the state of the processor. Therefore, the 
single-processor vertically-threaded processor of Joy is fundamentally incompatible with any 
switching that causes the processor to switch threads every processor cycle. Switching every 
cycle with a one cycle overhead would prohibit any forward execution progress, rendering the 
proposed combination inoperable for its intended purpose." 
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95. These arguments are not found persuasive for the following reasons: 

a) As described above, the idea extracted from Emer is merely that threads may be switched 
every cycle. Whatever logic allows threads to be switched every cycle in Emer would be 
implemented in Joy in order to achieve a thread switch every cycle. Clearly, Joy's logic would 
be replaced because Joy's logic causes overhead, as pointed out in applicant's argument, in while 
Emer's logic eliminates vertical waste, which is advantageous for throughput. 

96. In the first two paragraphs on page 25 of the remarks, applicant argues that Borkenhagen 
and Ramakrishnan do not disclose the deficiencies of Joy and Joy in view of Emer. These 
arguments have been found non-persuasive and the reasons why are evident in the rejections 
above (since applicant feels that the deficiencies are disclosed). Therefore, applicant's attention 
is directed to the rejections in which Borkenhagen and Ramakrishnan are combined with Joy 
and/or Emer. 

97. In the last paragraph on page 25 of the remarks, argues the use of the Gutgold reference. 
This argument has been considered but is moot in view of the new ground(s) of rejection. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to David J. Huisman whose telephone number is (571) 272-4168. 
The examiner can normally be reached on Monday-Friday (8:00-4:30). 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 



supervisor, Eddie Chan can be reached on (571) 272-4162. The fax phone number for the 



organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the. Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

DJH 

David J. Huisman 
October 25, 2006 
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